Minimizing Regret in Dynamic Decision Problems

نویسندگان

  • Joseph Y. Halpern
  • Samantha Leung
چکیده

The menu-dependent nature of regret-minimization creates subtleties in applying regret-minimization to dynamic decision problems. Firstly, it is not clear whether forgone opportunities should be included in the menu. We explain commonly observed behavioral patterns as minimizing regret when forgone opportunities are present, and also show how the treatment of forgone opportunities affects behavior in the classical secretary problem. Secondly, dealing with the dynamic inconsistency of non-Bayesian preferences requires techniques such as sophistication to be used in planning. Sophistication leads to even more options for the menu. We investigate different approaches to defining the menu, and the implications of each approach. Finally, we provide conditions under which dynamic consistency is guaranteed for a regret-minimizer.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sampling Based Approaches for Minimizing Regret in Uncertain Markov Decision Processes (MDPs)

Markov Decision Processes (MDPs) are an effective model to represent decision processes in the presence of transitional uncertainty and reward tradeoffs. However, due to the difficulty in exactly specifying the transition and reward functions in MDPs, researchers have proposed uncertain MDP models and robustness objectives in solving those models. Most approaches for computing robust policies h...

متن کامل

Weighted Sets of Probabilities and MinimaxWeighted Expected Regret: New Approaches for Representing Uncertainty and Making Decisions

We consider a setting where an decision maker’s uncertainty is represented by a set of probability measures, rather than a single measure. Measure-by-measure updating of such a set of measures upon acquiring new information is well-known to suffer from problems. To deal with these problems, we propose using weighted sets of probabilities: a representation where each measure is associated with a...

متن کامل

Stochastic p-robust location problems

The two most widely considered measures for optimization under uncertainty are minimizing expected cost and minimizing worst-case cost or regret. In this paper, we present a novel robustness measure that combines the two objectives by minimizing the expected cost while bounding the relative regret in each scenario. In particular, the models seek the minimum-expected-cost solution that is p-robu...

متن کامل

Strongly Adaptive Regret Implies Optimally Dynamic Regret

To cope with changing environments, recent developments in online learning have introduced the concepts of adaptive regret and dynamic regret independently. In this paper, we illustrate an intrinsic connection between these two concepts by showing that the dynamic regret can be expressed in terms of the adaptive regret and the functional variation. This observation implies that strongly adaptiv...

متن کامل

Regret in Dynamic Decision Problems

The paper proposes a framework to extend regret theory to dynamic contexts. The key idea is to conceive of a dynamic decision problem with regret as an intra-personal game in which the agent forms conjectures about the behaviour of the various counterfactual selves that he could have been. We derive behavioural implications in situations in which payoffs are correlated across either time or con...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015